Internet Phone Trailer

ABSTRACT

A training system is disclosed. The training system includes a server. The server includes a processor, data storage, a data input/output interface, and a voice interface. The data input/output interface is configured to provide a web-based interface to a data terminal and to receive, from the data terminal and via the web-based interface, both an identifier identifying at least one voice terminal and at least one response. The voice interface is configured to transmit at least one audio test signal corresponding to at least one audio test to the at least one voice terminal. The response is a response to the at least one audio test. In one embodiment, the voice terminal and the data terminal are combined in a single device.

CROSS-REFERENCE TO RELATED APPLICATION

This is a continuation of U.S. patent application Ser. No. 13/277,944filed on Oct. 20, 2011, the contents of which are hereby incorporated byreference.

BACKGROUND

Cochlear implants may provide a person having sensorineural hearing losswith the ability to perceive sound by stimulating the person's auditorynerve via an array of electrodes implanted in the person's cochlea. Anexternal component of the cochlear implant detects sound waves, whichare converted into a series of electrical stimulation signals deliveredto the implant recipient's auditory nerve via the array of electrodes.Stimulating the auditory nerve in this manner may enable the cochlearimplant recipient's brain to perceive a hearing sensation that issimilar to the natural hearing sensation delivered to the auditorynerve.

Cochlear implants thus offer cochlear implant recipients anapproximation of natural hearing. However, there remain many challengesfor cochlear implant recipients associated with using cochlear implants.Among these challenges is using the telephone.

For a cochlear implant recipient, telephone use is difficult for severalreasons. First, speech heard over the phone can sound different thanspeech heard in person, meaning in some cases the cochlear implantrecipient may be hearing unfamiliar sounds. Second, many cochlearimplant recipients rely on lip-reading and other visual cues from aspeaker to bolster their understanding of the hearing sensation providedby the cochlear implant. When using the telephone, however, such visualcues are typically not possible. Finally, the position of the telephonespeaker and/or coil in relation to the microphone or telecoil associatedwith the cochlear implant can have a significant effect on the qualityof the sound as perceived by the cochlear implant recipient, and itoften takes some experimentation to determine the optimal position forvarious telephones and environments. Between unfamiliar sounds, a lackof visual cues, and difficulty in positioning, telephone use may becomefrustrating and undesirable for many cochlear implant recipients.

Some cochlear implant recipients have found that, with practice, theycan learn to confidently use the telephone. In many cases, this practiceinvolves the cochlear implant recipient enlisting the help of one ormore people, such as relatives, friends, and/or clinicians. The peoplemay speak to the cochlear implant recipient over the telephone or in amock setting, so as to allow the cochlear implant recipient to practiceinterpreting speech heard over the telephone. Such practice can bebeneficial to the cochlear implant recipient, in particular because thepeople can interact with and provide feedback to the cochlear implantrecipient.

While such practice may be effective, it may also be time consuming forthe people enlisted to help the cochlear implant recipient. In somecases, the people may not always be available to aid the cochlearimplant recipient in practicing. Further, in some cases, the people maynot know how or may not be able to provide speech at a level of clarityand complexity that is appropriate for the cochlear implant recipient.Still further, the cochlear implant recipient may wish to improve his orher confidence before using the telephone with another person.

SUMMARY

Disclosed herein is a training system for training a cochlear implantrecipient or other hearing prosthesis recipient to use a telephone. Thetraining system includes a voice terminal, a data terminal, and aserver. The voice terminal includes (i) an input interface configured toreceive at least one audio test signal from the server and (ii) anoutput interface including a transducer and configured to output atleast one audio test corresponding to the audio test signal. The dataterminal includes (i) a display configured to display a web-basedinterface provided by the server, (ii) a user input interface configuredto receive both an identifier identifying the voice terminal and atleast one response to the at least one audio test output by the voiceterminal, and (iii) a data output interface configured to transmit boththe identifier and the at least one response to the server. The serverincludes a data input/output interface configured to (i) provide theweb-based interface to the data terminal and (ii) to receive, from thedata terminal and via the web-based interface, both the identifier andthe at least one response. The server also includes a voice interfaceconfigured to transmit the at least one audio test signal to the voiceterminal. In some embodiments, the identifier is a telephone number.

In some embodiments, the at least one audio test comprises at least oneprerecorded spoken phrase. The phrase may be one word or a series ofwords. In this embodiment, the at least one response may be a textualinput of at least a portion of the prerecorded spoken phrase.Alternatively or additionally, the at least one response may be aselection of a text version of the prerecorded spoken phrase from aplurality of available text versions of phrases. Still alternatively oradditionally, the at least one response may be a request to repeat theprerecorded spoken phrase. Still alternatively or additionally, the atleast one response may be an indication of how well a user heard theprerecorded spoken phrase. Still alternatively or additionally, the atleast one response may be an indication of whether the prerecordedspoken phrase was the same as or different than a previous prerecordedspoken phrase. Still alternatively or additionally, the at least oneresponse may be a selection of a graphical image corresponding to theprerecorded spoken phrase.

In some embodiments, the audio test comprises audio output at aparticular frequency. In these embodiments, the at least one responsemay be an indication of how well a user heard the audio testcorresponding to the audio test signal.

In some embodiments, the data terminal user input interface is furtherconfigured to (i) receive custom text associated with a personalizedaudio test and (ii) transmit the custom text to the server. In theseembodiments, the data input/output interface may be further configuredto receive the custom text and to convert the custom text to the audiotest signal.

In some embodiments, the voice terminal and the data terminal arecombined in a single physical device. In other embodiments, the voiceterminal and the data terminal may be different physical devices.

A server for use in a training system is also disclosed. The serverincludes a processor, data storage, a data input/output interface, and avoice interface. The data input/output interface is configured toprovide a web-based interface to a data terminal and to receive, fromthe data terminal and via the web-based interface, both an identifieridentifying at least one voice terminal and at least one response. Thevoice interface is configured to transmit at least one audio test signalcorresponding to at least one audio test to the at least one voiceterminal. The at least one response is a response to the at least oneaudio test.

In some embodiments, the server further includes audio test logicexecutable by the processor to (i) select the at least one audio test,(ii) determine whether the at least one response is correct orincorrect, and/or (iii) store the at least one response in the datastorage. In some embodiments, the audio test logic is further executableby the processor to (i) generate training results based on at least oneof (i-a) the at least one audio test, (i-b) a difficulty of the at leastone audio test, and (i-c) the at least one response, and (ii) displaythe results via the web-based interface.

Training methods are also disclosed. In some embodiments, the methodincludes a server providing a web-based interface to a data terminal andthe server receiving, from the data terminal and via the web-basedinterface, an identifier identifying at least one telephone destination.The method further includes (i) the server transmitting at least oneaudio test signal corresponding to at least one audio test to the atleast one telephone destination and (ii) the server receiving, from thedata terminal and via the web-based interface, at least one response tothe at least one audio test.

In some embodiments, the method further includes, after transmitting theat least one audio test signal to the at least one telephonedestination, the server displaying, via the web-based interface, animage related to the at least one audio test. Alternatively oradditionally, the server may display a light indicating a pattern ofsounds in the at least one audio test. Still alternatively oradditionally, the server may display a visual representation of a pairof lips speaking the at least one audio test.

In some embodiments, the method further includes the server determiningwhether the at least one response is correct or incorrect. In someembodiments, responsive to determining that the at least one response isincorrect, the server retransmits the audio test signal to the at leastone telephone destination. Alternatively or additionally, responsive todetermining that the at least one response is incorrect, the server maydisplay a hint via the web-based interface. Still alternatively oradditionally, responsive to determining that the at least one responseis incorrect, the server may select an additional audio test. In someembodiments, the selection is based on the at least one audio test.Alternatively or additionally, the selection is based on a difficulty ofthe at least one audio test. Alternatively or additionally, theselection is based on the at least one response.

In some embodiments, the method further includes the server recordingthe at least one response. In some embodiments, the method still furtherincludes the server (i) generating training results based on the atleast one response and the at least one audio test and (ii) displayingthe training results via the web-based interface.

These as well as other aspects, advantages, and alternatives will becomeapparent to those of ordinary skill in the art by reading the followingdescription, with reference where appropriate to the accompanyingdrawings. Further, it should be understood that the description providedby this document, including this summary section, is intended to explainthe present system, server, and method by way of example only and thatnumerous variations may be possible.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1 a-b show overviews of networks in which an example trainingsystem may be implemented in accordance with some embodiments.

FIG. 2 shows an overview of an example voice terminal in accordance withsome embodiments.

FIG. 3 shows an overview of an example data terminal in accordance withsome embodiments.

FIG. 4 shows an overview of an example server in accordance with someembodiments.

FIG. 5 is a flow chart illustrating a training method in accordance withsome embodiments.

FIG. 6 is a simplified communication flow diagram illustrating atraining method in accordance with some embodiments.

FIGS. 7 a-g show example web-based interfaces in accordance with someembodiments.

DETAILED DESCRIPTION

The following detailed description describes various features andfunctions of the disclosed systems and methods with reference to theaccompanying figures. In the figures, similar symbols typically identifysimilar components, unless context dictates otherwise. The illustrativesystem and method embodiments described herein are not meant to belimiting. Certain aspects of the disclosed systems and methods can bearranged and combined in a wide variety of different configurations, allof which are contemplated herein.

1. Example Training System

FIG. 1 a shows an overview of a network in which an example trainingsystem may be implemented in accordance with some embodiments. It shouldbe understood, of course, that this and other arrangements and functionsdescribed herein are provided by way of example only and that numerousvariations may be possible. For instance, elements can be added,omitted, combined, distributed, reordered, repositioned, or otherwisechanged while remaining within the scope of the invention as defined bythe claims. Further, it should be understood that various functionsdescribed herein can be carried out by hardware, firmware, and/orsoftware (e.g., one or more processors programmed with machine languageinstructions to carry out the functions).

As shown, network 100 includes a voice terminal 102 coupled to acircuit-switched network 104. Network 100 also includes a data terminal106 and a server 110 coupled to a packet-switched network 108.Additionally, network 100 includes a gateway 112 connected between thecircuit-switched network 104 and the packet-switched network 108.

Voice terminal 102 may be coupled to the circuit-switched network 104through a switch or other connection. In an embodiment, thecircuit-switched network 104 may be the Public Switched TelephoneNetwork (PSTN) and the switch may be a standard telephone switch. Otherexamples are possible as well.

For example, as shown by the dotted line extending between voiceterminal 102 and the packet-switched network 108, voice terminal 102 mayalso be coupled to the packet-switched network 108 through, for example,a proxy or other connection. In some embodiments, the packet-switchednetwork 108 may be the Internet, and the proxy may be a Session InitialProtocol (SIP) proxy, such as a Call Session Control Function (CSCF). Inthese embodiments, voice terminal 102 may be configured to place andreceive communications over the packet-switched network, such as Voiceover Internet Protocol (VoIP) communications.

As shown by the dotted line extending between voice terminal 102 anddata terminal 106, voice terminal 102 may also be coupled to thepacket-switched network 108 through data terminal 106 and one or moreother devices, such as an analog adapter. In this embodiment as well,voice terminal 102 may be configured to place and receivecommunications, such as VoIP communications, over the packet-switchednetwork. Voice terminal 102 is further described below in connectionwith FIG. 2.

Data terminal 106 may be coupled to the packet-switched network 108through one or more wired or wireless links. Data terminal 106 isfurther described below in connection with FIG. 3.

Server 110 may include one or more servers configured to carry out theserver functions described herein. Server 110 may be coupled to thepacket-switched network 108 through one or more wired or wireless links.Server 110 is further described below in connection with FIG. 3.

As shown in FIG. 1 a, network 100 further includes a gateway 112 that isconnected to both the circuit-switched network 104 and thepacket-switched network 108. Gateway 112 may be any entity configured tointerface between circuit-switched network 104 and packet-switchednetwork 108. In general, gateway 112 may be configured to translatebetween protocols used in the circuit-switched network, such as theSignaling system #7 (SS7) protocol, and protocols used in thepacket-switched network, such as SIP.

As shown in FIG. 1 a, network 100 may be accessed by a user 114. Asindicated by the dotted line extending between user 114 and voiceterminal 102, user 114 may use voice terminal 102 to initiate andreceive communications via one or both of the circuit-switched network104 and the packet-switched network 108. Similarly, as indicated by thedotted line extending between user 114 and data terminal 106, user 114may use data terminal 106 to access the packet-switched network 108. Theuser may be a hearing prosthesis recipient (such as a cochlear implantrecipient, for example), or may be any other kind of user.

Network 100 may additionally include elements instead of or in additionto those shown. For example, network 100 may additionally include one ormore base transceiver stations, base station controllers, mobileswitching centers, packet data serving nodes, home agents, and/orauthentication, authorization, and accounting servers for allowingwireless communication among voice terminal 102, data terminal 106, andserver 110.

While in FIG. 1 a voice terminal 102 and data terminal 106 are shown asseparate elements, in an embodiment voice terminal 102 and data terminal106 may be combined in a single device. FIG. 1 b shows an overview of anetwork in which an example training system may be implemented inaccordance with some embodiments. As shown in FIG. 1 b, network 100includes a combined voice and terminal 116 coupled to each of thecircuit-switched network 104 and the packet-switched network 108. Ingeneral, combined voice and data terminal 116 may be any deviceconfigured to connect to both the circuit-switched network 104 and thepacket-switched network 108, such as a smartphone, personal digitalassistant (PDA), laptop computer, tablet computer, or other device. Insome embodiments, combined voice and data terminal 116 may be configuredto connect to the circuit-switched network 104 using one or moreapplications running on combined voice and data terminal 116, such asSkype™, FaceTime™, Google Talk™, or another application.

FIG. 2 shows an overview of an example voice terminal in accordance withan embodiment. Voice terminal 200 may be, for example, a telephone. Insome embodiments, voice terminal 200 may be a landline telephoneconnectable to a circuit-switched network (e.g., the PSTN) via one ormore wired links. In other embodiments, voice terminal 200 may be alandline telephone connectable to a packet-switched network (e.g., theInternet) via one or more wired links and an analog adapter. In stillother embodiments, voice terminal 200 may be a cellular telephoneconnectable to a circuit-switched network (e.g., the PSTN) via awireless connection and one or more base stations and/or other accessnetwork entities. In yet other embodiments, voice terminal 200 may be anInternet Protocol telephone connectable to a packet-switched network(e.g., the Internet) via a wired or wireless connection. In stillfurther embodiments, voice terminal 200 may be a device configuredspecifically to operate within the disclosed training system. Othertypes of voice terminals are possible as well.

In any case, voice terminal 200 is identifiable by a unique orsemi-unique identifier such as a PSTN telephone number (E.164), mobileidentification number (MIN), mobile directory number (MDN),international mobile subscriber identifier (IMSI), username, emailaddress, or the like, which can be used to establish a communicationsession with voice terminal 200. The voice terminal 200 may share asemi-unique identifier with one or more other voice terminals, such aswhen the voice terminal is one of multiple landline telephones in a homeor office. Other examples are possible as well.

As shown, voice terminal 200 includes an input interface 202 and anoutput interface 204 connected together by a system bus or othermechanism 206.

Input interface 202 is configured to receive at least one audio testsignal from a server. To this end, input interface 202 may include awired or wireless interface through which voice terminal 200 may beconnected, either directly or indirectly, to a network, such that voiceterminal 200 and the server are coupled via one or more networks (e.g.,wireless networks, the PSTN, packet networks, etc. as described withrespect to FIGS. 1 a and 1 b).

Output interface 204 includes a transducer 208 and is configured tooutput at least one audio test corresponding to the audio test signalreceived at input interface 202. To this end, output interface 204includes a transducer 208, which may be, for example, an electroacoustictransducer or speaker. The audio test may be any audible sound, such asa prerecorded spoken phrase, spoken by a human or machine voice, oraudio output at a particular frequency. In some cases the audio testsmay be prerecorded phrases spoken by friends or relatives. Other audiotests are possible as well.

Voice terminal 200 may additionally include one or more elements insteadof or in addition to those shown. As an example, voice terminal 200 mayadditionally include one or more user input interfaces such as a keypadfor receiving input from a user, and/or a display for displayinginformation to a user. Other examples are possible as well.

FIG. 3 shows an overview of an example data terminal in accordance withan embodiment. Data terminal 300 may be, for example, a computer orother electronic device. In an embodiment, data terminal 300 may becombined with voice terminal 300 into a single device, such as a smartphone, personal digital assistant (PDA), or a device configuredspecifically to operate within the disclosed training system.

As shown, data terminal 300 includes a display 302, a user inputinterface 304, and a data output interface 306 connected together by asystem bus or other mechanism 308.

Display 302 is configured to display a web-based interface provided by aserver. The web-based interface may be, for example, a website providedby the server and accessed by the data terminal 300. Alternately, theweb-based interface may be an application or program supported by theserver and running on the data terminal 300. The application may be, forexample, a mobile application or other software designed to run on thedata terminal 300. The application may be pre-stored on the dataterminal 300, or may be downloaded from, for example, the server via,for example, the packet-switched network. In any case, the applicationmay be configured to support a wireless connection between the dataterminal 300 and the server. Display 302 may be any type of display,such as flat panel, liquid crystal, plasma, or other display. Exampleweb-based interfaces are shown in FIGS. 7 a-g and are described below.

User input interface 304 is configured to receive an identifieridentifying the voice terminal as well as at least one response to theaudio test output by the voice terminal. To this end, user inputinterface 304 may include one or more of a keyboard, mouse, joystick,trackball, touchscreen, microphone, or other input device. A user mayuse user input interface 304 to input the identifier and the response.Once the response is received through user input interface 304, anindication of the response may be displayed on the web-based interfaceshown on the display 302. Example web-based interfaces with responsesare shown in FIGS. 7 a-f and are described below.

Data output interface 306 is configured to transmit the identifier andthe response to the server. To this end, data output interface 306 mayinclude a wired or wireless interface through which data terminal 300may be connected, either directly or indirectly, to a packet-switchednetwork, such that data terminal 300 and the server are coupled via atleast the packet-switched network.

Data terminal 300 may additionally include one or more elements insteadof or in addition to those shown. As an example, data terminal 300 mayadditionally include one or more interfaces for connecting to one ormore peripheral devices, such as a printer, scanner, or other device.Other examples are possible as well.

FIG. 4 shows an overview of an example server in accordance with anembodiment. Server 400 may be, for example, a computer or a plurality ofcomputers on which one or more programs and/or applications are run inorder to provide one or more web-based interfaces that are accessiblevia a packet-switched network. As noted above, the web-based interfacesmay be, for example, websites provided by the server 400 and accessibleby a data terminal and/or applications or programs supported by theserver 400 and executable on the data terminal. In particular, server400 may be, for example, a web server configured to deliver HypertTextMarkup Language (HTML) documents to one or more clients, such as thedata terminal, via the packet-switched network in response to a request,such as a HyperText Transfer Protocol (HTTP) request, from the dataterminal.

As shown, server 400 includes a data input/output interface 402, a voiceinterface 404, a processor 406, and data storage 408 connected by asystem bus or other mechanism 410.

Data input/output interface 402 is configured to provide the web-basedinterface to the data terminal and to receive, from the data terminaland via the web-based interface, both the identifier identifying thevoice terminal and the response to the audio test. To this end, datainput/output interface 402 may include a wired or wireless interfacethrough which server 400 may be connected, either directly orindirectly, to a packet-switched network, such that server 400 and thedata terminal are coupled via at least the packet-switched network.

Voice interface 404 is configured to transmit at least one audio testsignal to the voice terminal. To this end, voice interface may alsoinclude a wired or wireless interface through which server 400 may beconnected, either directly or indirectly, to a packet-switched network.In some embodiments, data input/output interface 402 and voice interface404 may be combined.

As shown in FIG. 1, server 400 may be connected to the voice terminal inseveral ways, depending on the voice terminal. In some embodiments, thevoice terminal may be a landline or cellular telephone connected to acircuit-switched network (e.g., the PSTN) via one or more wired links.In these embodiments, server 400 may be connected to a gateway via thepacket-switched network, and the gateway may be connected to the voiceterminal via the circuit-switched network.

In other embodiments, the voice terminal may be a landline telephoneconnected to a packet-switched network (e.g., the Internet) via one ormore wired links and/or an analog adapter. In some embodiments, server400 may be connected to the analog adapter via the packet-switchednetwork, and the analog adapter may be connected to the voice terminal.

In yet other embodiments, the voice terminal may be an Internet Protocoltelephone connected to a packet-switched network (e.g., the Internet)via a wired or wireless connection. In this embodiment, server 400 maybe connected to the voice terminal via the packet-switched network.

Processor 406 may comprise one or more general purpose processors (e.g.,Intel™ microprocessors) and/or dedicated processors (e.g., digitalsignal processors). Data storage 408, in turn, may comprise one or moremagnetic, optical, organic, or other storage components, whethervolatile or non-volatile. In an embodiment, data storage 408 may beexternal to server 400, such as in a local database directly connectedto server 400 or in a remote database accessible by server 400 via theInternet.

Data storage 408 may include a database of audio test signalscorresponding to a plurality of audio tests. Some audio tests may beavailable to all users of the training system, while some audio testsmay be customized for particular users. For example, data storage 408may include personalized audio tests made by converting custom textreceived from a user. As another example, data storage 408 may includepersonalized audio tests recorded by friends or relatives of aparticular user for the user. Other examples are possible as well.Additionally, data storage 408 may include stored training results for aplurality of users. Additionally, data storage 208 may include audiotest logic 412. In some embodiments, audio test logic 412 may also beseparate from data storage 408.

In an embodiment, audio test logic 412 is executable by processor 406 toselect the audio test (and thus the corresponding audio test signal) tosend to the data terminal. Further, audio test logic 412 is executableby processor 406 to determine whether the response to the audio test iscorrect or incorrect. Still further, audio test logic 412 is executableby processor 406 to store the response in data storage, such as datastorage 408. In an embodiment, audio test logic 412 is also executableby processor 406 to (i) generate training results based on one or moreaudio tests, (ii) a difficulty of the audio tests, and/or (iii)responses to the audio tests. Further, audio test logic 412 is alsoexecutable by processor 406 to display the training results via theweb-based interface so that a user may view the training results on thedata terminal displaying the web-based interface. The training resultsmay thus be accessible to a user of the data terminal via the web-basedinterface. In some embodiments, the training results may additionally beaccessible to, for example, a clinician or other party via a web-basedinterface at another terminal. Audio test logic 412 may additionally beexecutable by processor 406 to carry out any of the server functionsdescribed herein. Server 400 may additionally include one or moreelements instead of or in addition to those shown. As an example, server400 may additionally include one or more user input interfaces such as akeypad for receiving input from a user, and/or a display for displayinginformation to a user. Other examples are possible as well.

As described in connection with FIGS. 1-4, the voice terminal isconfigured to receive audio test signals from the server and outputaudio tests corresponding to the audio test signals. In some cases, thevoice terminal may additionally be configured to receive audio inputfrom a user and to transmit the audio input to the server. Audio inputfrom the user may be used by the server to, for example, determinefrequencies that are best and least heard by the user. Other uses of theaudio input are possible as well.

Thus, as shown, the disclosed training system includes a voice terminal,a data terminal (which may be implemented in the same device as thevoice terminal), and a server. Interactions between each of the voiceterminal, the data terminal, and the server may be better understoodwith reference to FIGS. 5 and 6.

2. Example Training Method

FIG. 5 is a flow chart illustrating a training method in accordance withan embodiment. Method 500 may include one or more operations, functions,or actions as illustrated by one or more of blocks 502, 504, 506, and/or508. Although the blocks are illustrated in a sequential order, theseblocks may also be performed in parallel, and/or in a different orderthan those described herein. Also, the various blocks may be combinedinto fewer blocks, divided into additional blocks, and/or eliminatedbased upon the desired implementation.

Method 500 begins at block 502 where the server provides a web-basedinterface to the data terminal. Example web-based interfaces are shownin FIGS. 7 a-g and are described below.

At block 504, the server receives, from the data terminal and via theweb-based interface, an identifier identifying at least one telephonedestination. The telephone destination may be, for example, a voiceterminal such as the voice terminal described above in connection withFIG. 2. The identifier may be any unique or semi-unique identifier, suchas a telephone number.

At block 506, the server transmits at least one audio test signalcorresponding to at least one audio test to the telephone destination.Depending on the telephone destination, the server may transmit theaudio test signal directly to the telephone destination via apacket-switched network, or the audio test signal may be routed throughone or more packet-switched networks, circuit-switched networks,gateways, and/or adapters.

The audio test may take several forms including, for example, aprerecorded spoken phrase or audio output at a particular frequency. Inan embodiment, the audio test may be a personalized audio test createdusing custom text received from a user. In particular, the data terminalmay be configured to receive custom text from a user and may transmitthe custom text to the server. The server may convert the custom text toa personalized audio test using, for example, a speech synthesis systemrunning a text-to-speech or phonetics-to-speech program. Other examplesare possible as well. In another embodiment, the audio test may be apersonalized audio test recorded by, for example, a friend or relativeof the user.

At block 508, the server receives, from the data terminal and via theweb-based interface, at least one response to the audio test. Dependingon the audio test, the response may take several forms. For example, inan embodiment, the audio test may be a prerecorded spoken phrase, andthe response may be one or more of textual input of at least a portionof the prerecorded spoken phrase, a selection of a text version of theprerecorded spoken phrase from a plurality of available text versions ofphrases, a request to repeat the prerecorded spoken phrase, anindication of how well a user heard the prerecorded spoken phrase, anindication of whether the prerecorded spoken phrase was the same as ordifferent than a previous prerecorded spoken phrase, and a selection ofa graphical image corresponding to the prerecorded spoken phrase. Asanother example, in another embodiment, the audio test may be audiooutput at a particular frequency, and the response may be an indicationof how well the user heard the audio test. Other examples are possibleas well.

In an embodiment, after or substantially simultaneously with block 506,the server may display, via the web-based interface, a visual cuerelated to the audio test, such as one or more of an image related tothe audio test, a light indicating a pattern of sounds in the audiotest, and a visual representation of a pair of lips speaking the audiotest. Each of these visual cues may aid a user in understanding theaudio test output by the voice terminal. In some cases, these visualcues may be optional, such that a user may opt to show or not show them.In other cases, the visual cues may be used in early audio tests, butnot in later tests. Other cases are possible as well.

In an embodiment, the method 500 may additionally include the serverdetermining whether the response is correct or incorrect. In some cases,responsive to determining that the at least one response is incorrect,the server may retransmit the audio test signal to the telephonedestination, either immediately or at a later time, in order to allowthe user to re-hear the audio test. Alternatively or additionally, insome cases, responsive to determining that the at least one response isincorrect, the server may display a hint via the web-based interface.The hint may include, for example, any of the visual cues describedabove. Alternatively or additionally, the hint may include a portion ofa word or words spoken in the audio test. Other hints are possible aswell. Alternatively or additionally, responsive to determining that theat least one response is incorrect, the server may select an additionalaudio test based on one or more of (i) the previous audio test, (ii) adifficulty of the previous audio test, and (iii) the response to theprevious audio test. For example, the server may select an additionalaudio test that tests similar skills, sounds, words, and/or frequenciesas the previous audio test, and/or the server may select an additionalaudio test that is easier than the previous audio test, and/or theserver may select an additional audio test that tests similar skills,sounds, words, and/or frequencies as those in the response to theprevious audio test. Other examples are possible as well.

In an embodiment, the method 500 may additionally include the serverrecording the at least one response. Further, the method 500 may includethe server generating training results based on the response and theaudio test and displaying the training results via the web-basedinterface. The training results may include, for example, a history ofaudio tests taken by the user, scores on various audio tests, agraphical representation of the user's training progress over time, acomparison of the user's scores with compiled scores of other users,and/or an indication of particular skills the user needs to work on.Other examples are possible as well. An example of a web-based interfacedisplaying training results is shown in FIG. 7 g and is described below.In an embodiment, the method 500 may additionally include the serverdisplaying one or more tips on telephone usage via the web-basedinterface. For example, the server may display tips on how to position atelephone (or other voice terminal or sound processor) next to the ear,or on how to adjust one or more settings on the telephone, such as asensitivity of the transducer. As another example, the server maydisplay a recommendation for clinical follow-up in the event thatcertain sounds or frequencies are repeatedly misheard. The server may,for example, display a report that the user may print out and bring tothe clinical follow-up. Alternately, the report may be accessible by aclinician at the clinical follow-up by accessing the server via, forexample, a web-based interface. The server may display other tips aswell. These tips may be general, or the tips may be selected by theserver based on one or more of the audio test, a difficulty of the audiotest, and the response.

In an embodiment, block 502 may be carried out continuously, and blocks504-508 may be triggered by receipt of the identifier from the dataterminal via the web-based interface. In an embodiment, once blocks504-508 are triggered, blocks 506 and 508 may be carried outcontinuously until another trigger occurs, for example when one or bothof the voice terminal and the data terminal disconnect from thepacket-switched network. Other examples are possible as well.

FIG. 6 is a simplified communication flow diagram illustrating atraining method in accordance with an embodiment. It is to be understoodthat while FIG. 6 assumes successful communication between each of thevoice terminal, the data terminal, and the server, in some cases one ormore of communications 600, 602, 604, 606, 608, 610, and 612 may beunsuccessfully transmitted. Each of the voice terminal, the dataterminal, and the server may be configured to retransmit communicationsin case of an unsuccessful transmission according to one or more knownprotocols.

As shown, with communication 600 the server provides a web-basedinterface to a data terminal. As noted above, the server maycontinuously provide the web-based interface, and at a particularinstance the data terminal may access the web-based interface. The dataterminal accessing the web-based interface may involve, for example, auser of the data terminal entering a website address into a web browseror running an application on the data terminal. In general, the dataterminal accessing the web-based interface involves the data terminalbeing in communication with the server via a packet-switched network,such as the Internet. Other examples are possible as well. In any case,once the data terminal has accessed the web-based interface, the dataterminal displays the web-based interface.

With communication 602, the data terminal receives, from the user, anidentifier identifying a voice terminal. For example, the data terminalmay receive a telephone number. The data terminal receiving theidentifier from the user may involve the user typing or otherwiseentering the identifier into the web-based interface as displayed by thedata terminal.

With communication 604, the server receives the identifier from the dataterminal via the web-based interface. Using the received identifier, theserver may initiate and establish a connection (not shown) with thevoice terminal.

Once the connection between the server and the voice terminal isestablished, the server transmits an audio test signal to the voiceterminal with communication 606. Upon receiving the audio test signal,the voice terminal outputs an audio test corresponding to the audio testsignal with communication 608.

The user of the data terminal is also a user of the voice terminal.Thus, the user may hear the audio test output by the voice terminal andmay, in response, input a response to the data terminal. Withcommunication 610, the data terminal receives the response from theuser. The data terminal receiving the response may involve the usertyping or otherwise entering the response into the web-based interfaceas displayed by the data terminal.

With communication 612, the server receives the response from the dataterminal via the web-based interface. Using the received response, theserver may, with communication 614, for example, (i) display theresponse via the web-based interface, (ii) display one or more visualcues or hints, (iii) indicate whether the response is correct orincorrect, (iv) retransmit the audio test signal to the voice terminal,(v) select an additional audio test signal, (vi) transmit the additionalaudio test signal to the voice terminal, (vii) record the response,(viii) generate training results for the user based on the response,and/or (ix) display training results for the user. Other examples arepossible as well.

After communication 614, one or more communications may be transmittedbetween the server and the data terminal and/or the server and the voiceterminal. In general, communications similar to communications 606, 608,610, 612, and 614 may continue until one or both of the data terminalbecomes disconnected from the server.

3. Example Web-Based Interface

FIGS. 7 a-g show an example web-based interface in accordance with anembodiment. As noted above, the web-based interface may be, for example,a website provided by a server and accessible by a data terminal.Alternately, the web-based interface may be an application or programsupported by the server and running on the data terminal.

It is to be understood that each of FIGS. 7 a-g is merely illustrativeand is not meant to be limiting. The web-based interface may includefeatures and functions instead of and/or in addition to those shown.

FIG. 7 a shows an example web-based interface prompting a user to enteran identifier 702 and a name 704. As shown, the user (John Smith) hasentered a telephone number as well as his name. With the telephonenumber, the server may establish a communication session with the voiceterminal identified by the identifier. In an embodiment, instead of aname the user may be prompted to enter a predetermined useridentification of some kind or email address, and in some cases mayadditionally be prompted to enter a password. Other examples arepossible as well.

FIG. 7 b shows an example web-based interface prompting a user to entera response to an audio test. The audio test may be a prerecorded spokenphrase output by the voice terminal. For example, the voice terminal mayhave output the prerecorded phrase “I went to the grocery store today.”Upon hearing the audio test, the user may select a text version of theprerecorded spoken phrase from a plurality of available text versions706, as shown. Additionally, the user has the option to return to aprevious question 708 or to move on to the next question 710. Further,the user may be able to request a repeat 712 of the audio test. In anembodiment, when the user requests a repeat 712, the data terminal maytransmit to the server a request to repeat the audio test. Uponreceiving the request, the server may retransmit the audio test signalto the voice terminal, and the voice terminal may re-output the audiotest.

FIG. 7 c shows an example web-based interface prompting a user to entera textual input 714 of a prerecorded spoken phrase. The audio test maybe the prerecorded spoken phrase output by the voice terminal. Forexample, the voice terminal may have output the phrase “On Tuesday, itwill be partly cloudy.” Upon hearing the audio test, the user may enterthe textual input 714 of at least a portion of the prerecorded spokenphrase. In an embodiment, in determining whether the response to theaudio test (textual input 714) is correct or incorrect, the server maybe configured to ignore or accept minor variations in the textual input714, such as varying capitalization and punctuation or minor spellingerrors. In an embodiment, a threshold is used. For example, a textualinput 714 that is 95% correct (i.e., having 5% errors) may be determinedto be correct by server 400. Other examples are possible as well.

In addition to the textual input 714, the web-based interface includes avisual clue or hint 716. In an embodiment, the visual clue or hint 716may be displayed at the request of the user, responsive to adetermination that a response to the audio test was incorrect, or foranother reason.

FIG. 7 d shows an example web-based interface prompting a user toindicate how well the user heard an audio test. The audio test may be aprerecorded spoken phrase and/or audio output at a particular frequency.As shown, the user is presented with a first plurality of responses 718with which to indicate a clarity of the audio test. Similarly, the useris presented with a second plurality of responses 720 with which toindicate a volume of the audio test. Other indications of how well theuser heard the audio test, besides clarity and volume, are possible aswell.

FIG. 7 e shows an example web-based interface prompting a user to selecta graphical image corresponding to a prerecorded spoken phrase. Theaudio test may be the prerecorded spoken phrase output by the voiceterminal. For example, the voice terminal may have output the phrase“Today we are going to buy our Christmas tree.” Upon hearing the audiotest, the user may select the graphical image corresponding to theprerecorded spoken phrase, namely the graphical image of a tree 722.Other numbers and types of graphical images are possible as well.

FIG. 7 f shows an example web-based interface prompting a user toindicate whether a prerecorded spoken phrase was the same as ordifferent than a previous prerecorded spoken phrase. The audio test mayinclude two prerecorded spoken phrases output in succession by the voiceterminal. For example, the voice terminal may have output the phrase“Today is Saturday” and then output the phrase “Today is Sunday.” Uponhearing the prerecorded spoken phrases, the user may select “Same” or“Different” to indicate whether the prerecorded spoken phrases were thesame or different.

FIG. 7 g shows an example web-based interface displaying trainingresults for a user. The training results may be generated by the serverbased on one or more audio tests, responses to the one or more audiotests, and/or training results from one or more additional users. Asshown, the web-based interface includes a graphical representation of auser's audio test scores 724 and average audio test scores for aplurality of users 724 shown over time. In addition to the trainingresults shown, the training results may include various other kinds oftraining results, such as a history of audio tests taken by the user,scores on various audio tests, a graphical representation of the user'straining progress over time, and/or an indication of particular skillsthe user needs to work on. Other training results are possible as well.

The features and functions shown in FIGS. 7 a-g may be modified,combined, adjusted, added to, replaced, or otherwise varied to createseveral variations, all of which are contemplated herein.

4. Alternate Embodiments

In a first alternate embodiment, a network in which an example trainingsystem may be implemented may include a voice terminal, a data terminal,and a server. The voice terminal may be coupled to the server via one orboth of a circuit-switched network and a packet-switched network.Further, the data terminal may be coupled to the server via apacket-switched network.

The data terminal may be configured to display a client-based interface,such as an application or program supported running on the dataterminal. The application may be, for example, a mobile application orother software designed to run on the data terminal and may beconfigured to support a wired or wireless communications connectionbetween the data terminal and the server. Further, the data terminal mayinclude a processor and data storage that includes a database of audiotest signals corresponding to a plurality of audio tests, responses tothe audio test signals, and training results, which may take any of theforms described above. Additionally, the data storage may include audiotest logic executable by the processor to select an audio test (and thusthe corresponding audio test signal) to display at the data terminal(e.g., via an application running on the data terminal), receive aresponse to the audio test (e.g., via the application running on thedata terminal) and to determine whether the response to the audio testis correct or incorrect. Further, the audio test logic may be executableby the processor to store the response in the data storage. Stillfurther, the audio test logic may be executable by the processor todisplay the training results at the data terminal (e.g., via theapplication running on the data terminal) for viewing by a user of thedata terminal and/or a clinician or other party.

The server may include a data interface configured to receive an audiotest signal from the data terminal and a voice interface configured totransmit the audio test signal to the voice terminal. The voice terminalmay include an input interface configured to receive the audio testsignal and an output interface (including, e.g., a transducer)configured to output to a user the audio test corresponding to the audiotest signal received at input interface, as described above.

In a second alternate embodiment, a network in which an example trainingsystem may be implemented may include a voice terminal and a dataterminal. The data terminal may be coupled to the voice terminal via awired (e.g., Universal Serial Bus) and/or wireless (e.g. Bluetooth)connection. The data terminal may be configured to perform all of thefunctions of the data terminal in the first alternate embodiment, andmay further be configured to transmit an audio test signal directly tothe voice terminal. The voice terminal may be configured to perform allof the functions of the voice terminal in the first alternateembodiment, and may be further configured to receive a test signaldirectly from the voice terminal.

While various aspects and embodiments have been disclosed herein, otheraspects and embodiments will be apparent to those skilled in the art.The various aspects and embodiments disclosed herein are for purposes ofillustration and are not intended to be limiting, with the true scopebeing indicated by the following claims.

What is claimed is:
 1. A server comprising: a processor; data storage;and at least one interface component, wherein: (i) the server receivesfrom a data terminal an identifier of a voice terminal, (ii) based onthe received identifier, the server (a) transmits to the identifiedvoice terminal an audio test and (b) transmits to the data terminal aset of responses to the audio test, (iii) the server receives from thedata terminal a selection of one of the responses in the set ofresponses, and (iv) the server determines whether the selection iscorrect.
 2. The server of claim 1, wherein the identifier is a telephonenumber.
 3. The server of claim 1, wherein the server receives with theidentifier at least one of a user name, an email address, or a password.4. The server of claim 1, wherein the set of responses comprises atleast two textual phrases.
 5. The server of claim 1, wherein the set ofresponses comprises at least two images.
 6. The server of claim 1,wherein: the set of responses comprises a first subset of textualphrases and a second subset of textual phrases, wherein each textualphrase in the first subset of textual phrases corresponds to a level ofclarity of the audio test, and wherein each textual phrase in the secondsubset of textual phrases corresponds to a volume of the audio test; andthe at least one response includes a first response selected from thefirst set of textual phrases and a second response selected from thesecond set of textual phrases.
 7. The server of claim 1, wherein theaudio test signal includes a first spoken phrase, and wherein the set ofresponses comprises a first option and a second option, wherein: (a) thefirst option is indicative of the first spoken phrase being similar to asecond spoken phrase included in a previous audio test signal, and (b)the second option is indicative of the first spoken phrase beingdifferent than the the second spoken phrase.
 8. The server of claim 1,wherein the server is configured to provide to the data terminal a firstweb-based interface and a second web-based interface, wherein the firstweb-based interface includes a display of a field for entering theidentifier, and wherein the second web-based interface includes adisplay of the set of responses.
 9. A server comprising: a processor;data storage; at first component configured to provide a web-basedinterface; a second component configured to transmit at least one of aplurality of audio test signals, wherein each audio test signalcorresponds to an audio test included in a plurality of audio tests; andaudio test logic stored in the data storage, wherein the audio testlogic is executable by the processor to: (i) select an audio test fromthe plurality of audio tests and identify an audio test signal includedin the plurality of audio test signals, wherein the identified audiotest signal corresponds to the selected audio test, (ii) provide theweb-based interface to a data terminal, (iii) receive from the dataterminal a first input entered via the web-based interface, wherein thefirst input includes an identifier identifying a voice terminal; (iv)transmit to the identified voice terminal the identified audio testsignal; (v) transmit to the data terminal at least one image related tothe selected audio test, wherein the at least one image is displayed onthe web-based interface, (vi) receive, from the data terminal, a secondinput entered via the web-based interface, wherein the second inputincludes a response to the audio test, and (vii) determine whether theresponse is correct.
 10. The server of claim 9, wherein the at least oneimage comprises two or more images, and wherein the response isindicative of a selection of one of the two or more images.
 11. Theserver of claim 9, wherein the response to the audio test is a textinput.
 12. The server of claim 11, wherein the at least one imagecomprises a hint to the selected audio test.
 13. The server of claim 9,wherein the identifier is a telephone number.
 14. A server comprising:at least one component configured to communicate with a voice-and-dataterminal; a processor; data storage; and audio test logic stored in thedata storage, wherein the audio test logic is executable by theprocessor to: (i) provide a web-based interface to the voice-and-dataterminal; (ii) receive from the voice-and-data terminal an identifieridentifying the voice-and-data terminal, wherein the identifier isentered in a field displayed on the web-based interface, (iii) based onthe identifier, transmit one or more one audio test signals to theidentified voice-and-data terminal, wherein each of the one or moreaudio test signals corresponds to an audio test included in a pluralityof audio tests, and (iv) for each audio test, (a) transmit to thevoice-and-data terminal at least one of a set of responses correspondingto the audio test or at least one image corresponding to the audio test,(b) receive from the voice-and-data terminal at least one responseentered via the web-based interface, and (c) determine whether each ofthe at least one response is correct.
 15. The server of claim 14,wherein the web-based interface comprises an application configured torun on the voice-and-data terminal.
 16. The server of claim 14, wherein,for at least one audio test, the set of responses corresponding to theat least one audio test includes two or more textual phrases.
 17. Theserver of claim 14, wherein, for at least one audio test, the set ofresponses corresponding to the at least one audio test includes at leasttwo images.
 18. The server of claim 14, wherein, for at least one audiotest, the set of responses corresponding to the at least one audio testincludes a first set of textual phrases and a second set of textualphrases, wherein each textual phrase in the first set of textual phrasescorresponds to a level of clarity of the audio test, and wherein eachtextual phrase in the second set of textual phrases corresponds to avolume of the audio test, and wherein the at least one response includesa first response selected from the first set of textual phrases and asecond response selected from the second set of textual phrases.
 19. Theserver of claim 14, wherein, for at least one audio test, the set ofresponses corresponding to the at least one audio test includes, the atleast one image comprises a hint to the at least one audio test, andwherein the at least one response to the at least one audio testcomprises a text input.
 20. The server of claim 14, wherein, for atleast one audio test, the at least one response comprises a selection ofa response included in the set of responses corresponding to the atleast one audio test.